Learning Spatio-Temporal Structure from RGB-D Videos for Human Activity Detection and Anticipation

نویسندگان

Hema Swetha Koppula

Ashutosh Saxena

چکیده

We consider the problem of detecting past activities as well as anticipating which activity will happen in the future and how. We start by modeling the rich spatio-temporal relations between human poses and objects (called affordances) using a conditional random field (CRF). However, because of the ambiguity in the temporal segmentation of the sub-activities that constitute an activity, in the past as well as in the future, multiple graph structures are possible. In this paper, we reason about these alternate possibilities by reasoning over multiple possible graph structures. We obtain them by approximating the graph with only additive features, which lends to efficient dynamic programming. Starting with this proposal graph structure, we then design moves to obtain several other likely graph structures. We then show that our approach improves the state-of-the-art significantly for detecting past activities as well as for anticipating future activities, on a dataset of 120 activity videos collected from four subjects.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hand Gesture Recognition from RGB-D Data using 2D and 3D Convolutional Neural Networks: a comparative study

Despite considerable enhances in recognizing hand gestures from still images, there are still many challenges in the classification of hand gestures in videos. The latter comes with more challenges, including higher computational complexity and arduous task of representing temporal features. Hand movement dynamics, represented by temporal features, have to be extracted by analyzing the total fr...

متن کامل

ZHENHENG YANG, JIYANG GAO, RAM NEVATIA: SPATIO-TEMPORAL ACTION DETECTION WITH CASCADE PROPOSAL AND LOCATION ANTICIPATION1 Spatio-Temporal Action Detection with Cascade Proposal and Location Anticipation

In this work, we address the problem of spatio-temporal action detection in temporally untrimmed videos. It is an important and challenging task as finding accurate human actions in both temporal and spatial space is important for analyzing large-scale video data. To tackle this problem, we propose a cascade proposal and location anticipation (CPLA) model for frame-level action detection. There...

متن کامل

Spatio-Temporal Action Detection with Cascade Proposal and Location Anticipation

متن کامل

Physically Grounded Spatio-temporal Object Affordances

Objects in human environments support various functionalities which govern how people interact with their environments in order to perform tasks. In this work, we discuss how to represent and learn a functional understanding of an environment in terms of object affordances. Such an understanding is useful for many applications such as activity detection and assistive robotics. Starting with a s...

متن کامل

Extreme Learning Machine for Large-Scale Action Recognition

In this paper, we describe the method we applied for the action recognition task on the THUMOS 2014 challenge dataset. We study human action recognition in RGB videos through low-level features by focusing on improved trajectory features that are densely extracted from the spatio-temporal volume. We represent each video with Fisher vector encoding and additional mid-level feautures. Finally, we...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2013

Learning Spatio-Temporal Structure from RGB-D Videos for Human Activity Detection and Anticipation

نویسندگان

چکیده

منابع مشابه

Hand Gesture Recognition from RGB-D Data using 2D and 3D Convolutional Neural Networks: a comparative study

ZHENHENG YANG, JIYANG GAO, RAM NEVATIA: SPATIO-TEMPORAL ACTION DETECTION WITH CASCADE PROPOSAL AND LOCATION ANTICIPATION1 Spatio-Temporal Action Detection with Cascade Proposal and Location Anticipation

Spatio-Temporal Action Detection with Cascade Proposal and Location Anticipation

Physically Grounded Spatio-temporal Object Affordances

Extreme Learning Machine for Large-Scale Action Recognition

عنوان ژورنال:

اشتراک گذاری